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(57) Two-dimensional barcodes, each having en- 
coded digital information in a bitmap representing pref- 
erably randomized encoded data bits, are printed onto 
a printed medium. Preferably, error correction codes are 
added to the digital information to ensure, that the de- 
coding process accurately reproduce the digital infor- 
mation. In one embodiment, the bitmap may further in- 
clude "anchor" bits in each comer, which are used as 
part of the skew estimation and deskewing processes 
during decoding. In a second embodiment, no "anchor" 
bits are required. The encoded digital information is 
mapped into the two-dimensional barcode in such a way 
as to minimize the errors caused by damage to particu- 
lar rows and/or columns, for example, row damage 
caused by taxing the printed barcode. To extract the en- 
coded digital information from the printed medium, the 
printed medium is scanned, then the bitmap is located 
within the printed medium. The skew of the bitmap, if 
any, is determined, and the bitmap is deskewed if nec- 
essary. The bitmap is then cropped, and the randomized 
digital information is read from the bitmap. The digital 
information is derandomized and any error correction 
codes are removed, in the process correcting and/or re- 
cording any errors discovered, thereby reproducing the 
original encoded digital information. 
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Description 

Field of the Invention 

[0001] The invention relates generally to two-dimen- s 
sional barcodes, and, more particularly, to a robust clock 
free two-dimensional barcode symbology, encoding in- 
formation formatted in such barcode symbology, print- 
ing the barcode symbology onto a printed medium, and. . 
decoding the same. io 

Background of the Invention , 

[0002] , Contrary to the frequent predictions that we, will, 
one day live in a "paperless society r, paper, and. other yis 
printed mediums, are playing an increasingly important ; 
role as an inexpensive, effective and convenient meansr > 
for communication. A fundamental limitation with paper, 
however, is that from a computer standpoint, it ..is cur. 7 ? 
rent ly. an, output-only format. .While,paper may be the ; . 20 
preferred medjum for displaying information fonhuman 
use, it is difficult, if not impossible, for a computer to re- -y 
cover data reliably once it has . been printed. Optical .v 
character recognition (OCR) attempts to solve this prob- 
I em in a relatively simple domain, such as text rendered , 25 
using standard fonts, but has met. with only limited suc- 
cess thus far. While accuracy rates of ninety-nine (99%) 
percent areperhaps achievable and may seem impres T 1 , 
sive, a page with 3,000 characters will still incur an av- , . 
erage of thirty (30) OCR errors and hence requires ex- , ,30 
pensive and time consumingmanual post-processing. \ 
[0003] Another approach uses computer, .readable 
barcodes which may be included directly on paper (or . 
other printed medium such as microfilm). Once encod- : . 
ed, such barcodes can be used by the computer to re-.. 35 
cover information evident to thejiuman reader but diffi-r v 
cult for a computer to recognize (e.g., printed text), in- , 
formation implicit to the creation of page but essentially 
invisible to the human reader (e.g., spreadsheet formu- 
las), or any other information desired^whetheror not de-,. \40 
pendent on the actual character text on the paper. 
[0004] Computer readable barcodes, wherein digital . 
data is recorded directly on paper, are known and have. , 
been utilized to provide document or product identifica- 
tion given a fixed set of values using simple numeric en- 45 
coding and scanning technologies. -Document or prod- 
uct identification systems which have been employed in . 
the past include barcode markers and scanners which 
have found use in a wide range of arenas. With respect - 
to paper documents, special marks or patterns in the so 
paper have been used to provide information to a related 
piece of equipment, for example the job control sheet - 
for image processing as taught by Hikawa in U.S. Patent 
No. 5,051,779. Similarly, identifying marks comprising 
encoded information have been printed on the face of ss 
preprinted forms as described in U.S. .Patent No. . 
5,060,980 to Johnson, et at. The Johnson, et al. system 
provides for a user entering hand drawn information in 



the fields on a paper copy of the form and then scanning 
the form to provide insertions to the fields in the dupli- 
cate form that is stored electronically in the computer. , 
Still another system, is described in U.S. Patent No. 
5,091 ,96.6 of Bloomberg, et al., which teach the decod- 
ing of glyph shape codes, which codes are digitally.en- 
coded data on paper.- The identifying codes can be read 
by a computer and thereby facilitate computer handling 
of the document, *such as identify ingi. retrieving and. 
transmitting such document. . . 0 : ^ ; v-; S! f 
[0005] Besides j the various * shaped, barcodes . cter 
scribed above, .two-dimensional- barcodes called: "data 
strips", having a plurality of rows of "data lines-" that rep- 
resent information digitally encoded on printed media 
are also known in the art. Each data, line row consists of 
a series of black and white pixeis.each representing bi- 
nary : "Q"s and ( B 1 "s, Jhe ordering of the bits in each- row . 
detenmines^the^digitalrdata- stored jherein^The data 
stored with ifl^the totality of* the rows define the data con- 
tained in the 1 word imensional.barcpde. Typically, to read . 
the,barcode, the user passes a. hand scanner, which si- 
multaneously reads , the, information in each . data line 
row, vertically along the length of the barcode to read all 
of the data line/ rows. ..,. . t ^ . . .....■-,! ' . 

[0006], f ^n example of. a prior art system using a data 
strip two7dimensiqnal barcode having rows of data lines 
with . paper-, media, > is found in U.S. Patent -Nos. 
4,692,603 : 4,-754,127 and 4,782,221 of Brass, ,et al. In 
this system, Uvordimensional barcodes consist of data 
line rows which are used to encode computer programs 
and data, on paper and.are- scanned by use. of a hand 
scanner. In addition to encoding the computer programs 
and data, these data lines also contain tracking and syn- 
chronization bits, hereinafter, referred to as "clock bits?. 
The requirement for use. of numerous clock bits directly . 
within . each . data >.Une- row* significantly reduces the 
amount of digital data that can be stored within each row. 
Further, if datajine rows having clock bits are damaged, 
which is- common, if such. barcodes are photocopied or 
i transmitted by ; facsimile systems; such clock bits would 
be lost making; it difficult, if not impossible, to decode the 
information encoded jn the barcode. . , 
[0007] ,U.S..-patenti No. 5,083,21 4 ; of Knowles de- 
scribes a two-dimensiongkbarcpde system that also re- 
quires embedding clock,bits within the encoded data it- 
self: However r instead of encoding every row and col- 
umn of data with synchronization and clock bits, the 
Knowles '214 Patent, uses the uppermost row in a data 
barcode to provide synchronization during scanning. 
Further, the first eight left most columns of bits comprise 
clock bits .that are- used fo^establishing the clocks, to • 
read the data bits contained in each row. Since the up- . 
permost row and all . eight leftmost columns of the two- 
dimensional barcode, described in the Knowles '214 
Patent, are used solely for synchronization and clock- 
ing, the overall data capacity of the barcode is propor- 
tionally decreased. Furthermore, if the uppermost row 
or left most columns of the barcode are damaged, clock 
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information will be lost making it difficult; if not impossi- * 
ble, to read out the data encoded in each 'data row as 
with other prior art systems. 

[0008] U.S. Patent No: 4,924 ; 078 to Sant'Anselmo et 
al. describes a two^iimensionafbarcbde 'system in' s 
which an orientation and/or timihg'ceil bdrder'is included : 
within the body^ofthe barcode' itstelf.' For -'all' cases in 
which the exact Ibcatibh;' symbol size ^a'nd" data density 
are not knbwrt in advance of reading the barcode, the 
border is required by the Sant-AhselmoWS Patent and 10 
must be smcoth ; ahd on at least two sides of the data 
area. In addition, timing cells are required when certain 
information 1 is'flbt a priorhknown, such as the .number of- 
cells perside. Here^gaihy because i the orientation and/" 
or timing cell border is ! included within the barcode itself, is 
the bveralFdata : cfapacity l of the barcode is* proportionally 
decreased. F iJ rtti e rrhb re', b eca use * c ritical inf brmation 1 1 ■ 
about decoding the' barcode is ei)rt?acted f frorn thVbrien-'' 1 : ' 
tation border, darna^e ? td a single row 6r ! cblurf»n :, df thi§- 
embodiment of'the Sant'AnsefmB '078 Patent -in which"' so 
the border is present oh only twb "sides -also- makes'tt*''' 
difficult, if not irnpossible' thread but ] the 1 data * encoded : 
within the barcode. *' ■■ ""' ■ - wv. 

[0009] In co-pending patent applicatioh j n A t Cibcfc : Pree ! 
Two-Dimensional Barcode and Methc& fbr Printing aha" 7 ' 2s 
Reading the' Same", (Serial No. 08/569", 280; r filed* DW • 
cember 8, i 995) ("the '280 Application-) now published' - 
as EP-A-07831 60, the -contents of which' are explicitly" ' 
incorporated by reference herein, a clock-less two^'di- ' 
mensional bkrcode with a border on at least one of the * -30 
four sides of the barcode' is described, which border is- 
placed outside the confines oTthe barcode itself.-The 
two-dimensional barcodes are called "PanaMarks"®; • 
As depicted in FIG. 1 A, PanaMark -10 is printed in the 1 
low right hand corner of printed page TV, although this* ' 35 
position is completely arbitrary-^ the embodiment de-' - 
picted in FIG. 1 A, the remaining portion of printed page ; • 
11 is occupied by printed 'text 12. Howeverr'as one ' 
skilled in the art will appreciate; any type of computer- > 
generated printed material, for example- a spreadsheet -*40 
or graphics, can b & substituted' for r the printed text 12. 
The PanaMark 10 depicted'iri FIG! IB includes ai border * 
1 3 that is present bn all'f our of its sides; Although, as is 
fully described in the *>B0 Application, the border 13 is 
only needed on one of thefour sides of thePari~aMark ;i 
10, for aesthetic reasons it -is typically included ori all ~* 
four-sides. ! — ■* 1 " * w '•■ ■ 

[001 0] It is therefore an object of the present invention * 
to provide a clock free two-dimensional barcode J repre-^ 
sent in g digitally encoded data. m • so 

[0011] It is an additional-object of the- present inven- 
tion tb provide a border-less clock free two-dimensional 
barcode representing digitally encoded 1 data. " 
[0012] Another object of the invention is to provide a 
border-less clock free two-dimensional barcode printed' ss 
on a medium that is sufficiently robust to withstand dam- 
age to the medium.' 1 
[001 3] A further object of the invention is to provide a 



border-less clock free two-dimensional barcode repre- 
senting digitally encoded data, without the need for any 
internal blocks used for alignment purposes. 
[0014] It is yet another object of this invention to en- 
code information formatted in a border-less clock free 
two-dimensional barcode for printing on a printed medi- 
um. . 

[0015] ' An additional object of this invention is to pro- 
vide a method of decoding information digitally encoded 
in the form of a border-less clock free two-dimensional 
barcode printed on a printed medium. 
[0016] Various other objects, advantages and fea- 
tures of the present invention will become readily appar- 
ent'frbm the ensuing detailed description and the novel 
features will be particularly pointed out in the appended 
claims: ' • 

[0017] These and other objectives are realized by a 
two-dimensional data barcode having encoded digital 
information in a bitmap formatted representation of ran- 
* dorriized l da'ta bits' having a selected pixel row and col- 
umn length. Preferablythere is no border surrounding 
the barcode: 

[00*18] -'In one embodiment of this invention; a two-di- 
mertsional data barcode^has encoded digital information 

■ in a) bitmap formatted representation of randomized data 
bits having'a selected pixel row and ! column length, with 
"black" data pixels in each comer and no border sur- 
rounding the barcode. In another embodiment of this in- 
vention, there is'ho requirement that "black" data pixels 

^occupy each corner of the barcode. 
[0019] '-Advantageously each data bit "within each 
block in* said randomised encoded information is sepa- 
rated from every other data bit within the same block by 
at least one row and one column in said two-dimensional 

*" barcode! - '■■ ■ 
[0020] - 'As a further aspect of this invention, a method 
of encoding information, for printing on a printed medi- 
um, sudh as paper, is described. Information is first con- 
verted into 1 a data stream having a sequence of binary 
characters 'representing data bits. The data stream is 
then randomized 4b -produce approximately an equal 
nurhber of "0" and "1" characters. Thereafter, the rand- 
omized data stream is formatted into a two-dimensional 
bitmap of rows and columns and printed onto the printed 
medium. ' ■ ; ; : ' 

[0021] In- another aspect of the invention, information 
is converted into a data stream having binary characters 
representing data bits. The data stream is mapped to 
produce digital data bits arranged in a* two-dimensional 
bitmap of rows and columns having the four comer lo- 
cations vacant. The four corner locations are set to the 
logical character representing a "black 0 data pixel. 
[0022] Another aspect of this invention relates to a 
method for mapping the data stream representing the 
information to be encoded to produce digital data bits 
arranged in a two-dimensional bitmap of rows and col- 
umns such that each data bit of a given block is sepa- 
rated from every other data bit within the same block by 
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at least one row and one column, which ensures that 
the bitmap is robust, with respect to row and, column 
damage. 

[0023] As a further aspect of this invention, a method 
for ensuring that the information encoded into the two- s 
dimensional barcode is adequately randomized is pre- 
sented. A number of alternate keys are available for ran- 
domization during the encoding process, so that if a first 
key does not produce an adequate level of randomiza- 
tion, a second key can be used. This process of testing io 
additional keys continues until the randomization reach- 
es a predefined level. During decoding, the error cor- 
recting code is used to select the most appropriate key 
for derandomization, as the improper key will produce 
excessive errors. i . . i f' 1 . 5 

[0024] Preferably, in all embodiments of the twodh 
mensional barcodes of this invention, the .encoded in-, ; 
formation printed on the printed medium is .randomized . .. 
to safeguard against, strings of contiguous pixels being 
the same color which may occur without randomization. 20 
[0025] In another embodiment of this invention, inforr 
mation printed, on a printed medium in the form of a two- 
dimensional bitmap of rows and columns of data pixels r 
representing encoded data bits is decoded. The printed t 
medium contains other information in addition to the bit-*., 2S 
map. The printed medium on which the bitmap is printed , 
is scanned, and the bitmap is. located with in the printed 
medium. The skew, angle of the bitmap . is determined 
with respect to the origin of the scanned printed medium, > 
and if the angle is too large, the bitmap is deskewed. 30 
Once the bitmap is correctly aligned; it is cropped and < 
the binary data from the data pixels is read to produce 
a two-dimensional,digital array corresponding to the en- , , 
coded data bits and comer bits, if present, respectively. 
[0026] In a lurther embodiment of this aspect of this 35 
invention, a method of ensuring that the data is properly , 
read from the deskewed, cropped bitmap is presented, f . 
whereby four alternate data streams are created by . , 
reading the data in four alternate directions, and the er- 
ror correcting code is then used to select the data stream . 40 
with the fewest errors for output. ■ , 

Brief Description of the Drawings 

[0027] The following detailed description, given, by 
way of example and not intended to limit the present in-. • 
vention solely thereto, will best be understood in con- \ 
junction with the, accompanying drawings in : which; 
[0028] Figure .1 A is a diagram schematically illustrat- .. 
ing the two-dimensional barcode of the '280 Application so 
printed on a-page of .printed text,. and Figure 1B shows 
an example of the two-dimensional barcode of the '280. 
Application. 

[0029] Figure 2 shows an example of. a two-dimen- 
sional barcode in accordance with the present invention. 55 
[0030] Figure 3 is a flow chart showing the steps for 
encoding and decoding information onto a printed me- 
dium in accordance with the present invention. 



[0031] Figure 4 shows an example of a bad. data bit . 
layout for the .two-dimensional barcode. 
[0032] Figures 5A andSB show an example of a better 
data bit layout ; for the two-dimensional barcode. 
[0033] Figure 6 shows an example of optimal, data bit 
layout for the two-dimensional barcode in accordance 
with the present. invention. , 

[0034] Figure 7 A shows an example of a. randomized 
two-dimensional barcode with "BAD"; randomization re- 
sults, and Figure 7B shows an example of a randomized 
two-dimensional barcode, with "GCKDD" .randomization 
results. . ....... , r ; - 

[0035] Figure.8A is a flowchart, describing how to read 
one embodiment of. the two-dimensional barcode in ac- 
cordance with the present invention, and Figure 8B is a 
flowchart describing how to read a second embodiment 
of the. two-dimensional barcode in accordance, with the 
present invention. :; . .-, . , . . , . r„ -.. . 
[0036] , Figures, 9A. and 9B ^illustrate. J how the two-di- 
mensional barcqde is, located, with in. the printed page in 
accordance with the present invention^ 
[0037] .Figures, .1 OA. and 10B illustrate how the skew 
of thejwo-dimensional barcode is calculated in accord- 
ance wjth ; pne,aspect of the present invention . . . 
[0038] .. Figure. 11. illustrates a parametric representa- 
tion of, a straight line, . . . 
[0039] Figure 12A and 12B .illustrate representations 
of a single -point within a^plane, and two points within a 
plane, .respectively. • . < 
[0040] Figures. 13A and 1.3B show the alignment of 
edge pixels < in ideal and. skewed two-dimensional bar- 
codes of the, present invention. , 
[0041] Figures ,14A and 14B illustrate the four edges 
of a PanaMark segment and the two masks used to de- 
tect the edge pixels in;the Hough Transform-based skew t 
detection procedure in accordance with one aspect of 
the present, invention.. 

[0042] , Figure 15 A shows a skewed two-dimensional 
barcodeand Figure 1 5B shows the edges detected with- 
in the two-dimensional barcode of Figure. 15A by using 
the Hough Transform in accordance with the present in- 
vention. ( , .r ; v - - : - ;i • 

[0043] Figure. 13 illustrates the simple deskewing 
process for the two-dimensional barcode in accordance 
.with of the present invention. < 
[0044] Figure ; 1 7 shows two ways of applying, skew, 
correction to the .two-dimensional barcode in accord- 
ance with the, present invention, 

[0045] . Figures 1 8 A- 18D.il I gst rate the four alternate di- 
rections for : reading the data from the two-dimensional 
barcode in accordance, with the present invention. . , 

Detailed Description of the Preferred Embodiments . 

[0046] As fully described in U.S. Patent Nos. 
5,625,721 and 5,703,972 to Lopresti et al. which are 
both entitled "Certifiable Optical Character Recognition" 
and in the co-pending patent application entitled "A 
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Method and Means For Enhancing Optical Character 
Recognition of Printed Documents" (Serial No. 
08/1 38,467, filed October 15, 1993), currently under al- 
lowance, the contents of which are all explicitly incorpo- 
rated by reference herein, information' about the con- s 
tents, layout, generation and retrieval'df a document can 
be encoded by a computer when initially generating the 
document or 1 upon" subsequent computer processing 
thereof. The encoded 'document information'tan then 
be provided'via a data barcode, -Which is also referred io 
to herein as' a "PanaManV®, gene'rated'onHhe face of 
a printed version of the document. Advanced encoding 
and print resdiuti oh capabilities presently available can 
accommodate' up io 30, GOG bits of infdrmati6n tn : a : single 
square inch of spacerThere'fore; as taughtby'the above- * *5 
referenced applications, one can theoretically encode " 
the : entfre document cohterits, h: limit'ed dhly • by the 1 - 
amount of space on the document face that oh&i's willing" 
to sacrifice to the PariaMarki A barcode* scanner? in con- * 
junction with or ■'whoTly' separate 'frorri ran optical 'page ' 20 
scanner, can scan the PanaMarK : arid provide the irifbr-' : 
mation to ah associated" system equipped" with* the a : p- 
propriate recognition and' 'decbdirig software. ] The 'tie-''' ' 
coded information can : then be used by : the : scahning ; - ' 
system to-'create a new version' of the'document'oriS/ ^ 2S 
enhance the recognition, reproduction and error correc- M 
tion lor the scanned 'document. To decode ttne Pan- u 
. aMark, it is not required that such barcode scanner and 
scanning system know the printing resolution of the- 
Pa na Mark; provided that the scanning resolution of the *3b 
scanner is at least as fine as the printing resolution of * 
the PanaMark, so that each individual 1 or larger - 
pixel of the PanaMark, representing a logical bit, can be 1 
scanned. ' "' ' • ■ • * -v r 

[0047] The information encoded in the form of a-Pan- »■ 35 
aMark can be used to enhance the software tools al- y 
ready used to create paper documents. Examples in- 
elude word processors/ spreadsheets, object-oriented* 
graphics, 1 and multimedia applications, such as voice re- 1 
cording and photocjraphic imaging. ; - ; - ■ - 4o 

[0048] The border 13 "used in the r PanaMark*1 Oof FIG.' 
1 was not a critical feature of the invention disclosed in 
the '280 Application, as most of the key procedures de- 
scribed therein work whether or not a border is present. 
However, the border 1 0 was used "in the '280 Application 45 
by the skew estimation and deskewing steps. 
[0049] FIG. 2 illustrates an example of the new Pan- 
aMark symbology. PanaMark 20 comprises an encoded' ■ 1 
set of data bits in a two-dirhehsional grid; Typically, 'each ' * = 
data bit which is encoded is printed as a matrix of black so 
or white pixels 23. Preferably, a pixel matrix represent- 
ing one data bit is square and may be as small as a 1 x 
1 matrix to as large as a 6 x 6 matrix or more:. Non-. 
square matrices may also be used. There are no clocks 
or borders needed or required in the symbology for Pan- '* ss 
aMark 20. In the preferred embodiment, the PanaMark 
20 is a 20 x 20 array of data bits, although it can be 
recognized that the size is flexible 1 and that the only re- 



quirement on the size is that the reading process know 
the size of the encoded array. In some applications, it 
may be desirable to include a text label 22 adjacent to 
the PanaMark 20, although optional and of no signifi- 
cance to either the encoding or decoding process. 
[0050] Two different embodiments are described 
herein. In the first embodiment, the four corner bits 21 
are always black and are necessary for the template 
matching skew estimation process discussed in greater 
detail with respect to FIG. 10 The four corner bits 21 in 
the first embodiment are called "anchor" bits. The re- 
maining data bits in the first embodiment are pseudo- 
randomized and can hold any combination of desired 
information and error correction bits. The symbology of 
the first embodiment provides for good skew estimation 
when the skew is small and the PanaMark 20 is free 
from an^ damage. However, the placement of the an- 
chor Bits 21 in- the corner makes them susceptible to 
damage. Thus, in the '-second embodiment, there is no 
requirement for any "anchor bits 21 and the PanaMark 
20 is simply a N x M array of data bits, preferably with * 
N = M = 20. In the second embodiment, all of the data 
bits are pseudo-randomized and can hold any combina- 
tion of desired information and error correction bits. 
' [0051 ] Figure 3 illustrates the steps involved in the en- r c - 
coding/decoding process. Except as discussed herein, ' 
each of ihe steps are more particularly described in the *' 
'280 Application. During the encoding process: input da- ' T 
ta in the form of a ohe-dimensional linear bitstream is * 
first processed to add a standard, block-based Brror cor- 
rection codeCECC") at step 30, randomized at step 31 , - 
mapped from a one dimensional- bitstream to a two-dt- ^ 
mensional representation, the PanaMark, at step 32, r * 
and the PanaMark is finally printed at step 33. The* de- 
coding process repeats these steps in reverse order, the 
printed PanaMark is read at step 34, mapped from a' iv * 
two-dimensional to one-dimensional representation in 
step 357 de randomized at step 36, and finally the ECC 
is applied at step 37 to recreate the "raw" linear bit- 
' stream. As part of the" new PanaMark symbology pre- 
sented herein, improved methods for randomizing," * 
mapping, reading, and applying the ECC are also pre- 
sented. 

[0052] Any ECC can, ol. course, be implemented in a 
manner which allows some number of bit errors to be 
corrected. While such errors are generally unavoidable, . 
the method by which data is mapped into two dimen- 
sions can be chosen to minimize the impact of certain 1 
kinds of errors on the readability of the PanaMark as a 
whole. The bits are mapped so that systematic damage 
to the mark is not likely to cause niore than one bit error 
in each block, or codeword, which is what the ECC can 
correct. 

[0053] One method, shown in FIG. 4, of placing the 
bits from a single block 40 sequentially in row- or col- 
umn-major order within barcode 41, is a poor choice. 
When an image is scanned, it is not uncommon to see 
correlated damage along specific rows or columns of 
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pixels, as depicted in row 42. In the case of flatbed scan- 
ners and fax machines, row damage arises as a. result 
of physical motion; either the'CCD sensor array being 
moved down the page, or the page being drawn past 
the stationary sensors. Any irregularity in this motion will . 5 
lead to distortion across certain rows of scanned image. , 
Notethat row 42 experiences seven bit errors and hence 
is totally lost. On the other, hand, column damage can , 
occur when certain of the CCD sensors have gone bad, 
leading to black stripes running down the length of the 10 
page. .... 
[0054] As shown in FIGS. 5A and 5B, a much better 
strategy is to. place successive bits from a single block ^ 
along diagonals within barcode 45. lnthis case I .only one 
bit in each block is affected by damage to correlated row is 
46 or column 47. Since the ECC can correct one bit error 
per block, this means that all the blocks are decoded ( t ■ 
successfully. Bymapping the blocks so that the damage. . 
is distributed uniformly across them, the overall robust-, . 
ness of the symbology is improved. . 20 

[0055] . It is. even more desirable if bits from a single . 
block can be placed in non-adjacent rows and columns, 
because the noise effects described earlier could poten- . 
tially straddle row or column boundaries. As long as the?.,, 
basic block size is less than half the dimension of the ?s 
mark, this is possible. This is accomplished by the map- : 
ping process depicted in FIG. 6. The barcode 50 illus- 
trated in FIG. 6 shows how block zero 51, is mapped on 
the major diagonal of logical PanaMark 50 with each bit ... 
separated by at least one row and one column. Hence, 30 
even damage that destroys two adjacent rows orcol- 
umns will not, by jtself, prevent the correct decoding of . t 
all of the blocks. Although FIG. 6 depicts a logical -Pan- 
aMark 50 with four anchor bits 52, it will be recognized ; 
that the anchor bits 52 are only necessary when the tern- . 35 
plate matching skew angle detection method is used, as. , 
is discussed herein with respect to FIGS, 10A and 1 0B. . . 
[0056] In exchange for doing away with. explicit clock 
borders, the reading^ procedure described in the '280 
Application requires the presence of sufficient edge 40 
transitions to synthesize the implicit clocks. Large areas, , 
of all-white or all-black pixels within the PanaMark are. 
problematic, because of the consequent reduction in the 
number of edges. In practice, such regions might arise 
due to regular patterns in the underlying data unless 45 
steps are taken to prevent it. For instance, English text 
represented in standard 8-bit ASCII always has a '0' in 
the high-order bit position. FIGS. 7A and 7B give exam- 
ples of a "bad". PanaMark 60 and a "good 0 PanaMark 
61 , respectively. Both PanaMarks 60 and 61 are shown so 
with the optional text label 62. Note that PanaMark 60 ~ 
is perfectly valid - it encodes the same number of bits 
as PanaMark 61. PanaMark 60 would, however, be im- * 
possible to read reliably using the techniques presented . 
in the '280 Application. . .. 55 
[0057] Such situations can be prevented from occur- 
ring by randomizing the input bitstream. This means , f 
that, on the decoding side, the bitstream must be de- 



randomized. This , process should actually be called 
"pseudo-random izatipn" because it is always Known 
how to undo its effects, A simple, well-known approach . 
to implement randomization is to perform an exclusive 

. 8 OR n .{ n XOR B ) operation on the input stream. bjt-by : bit 
with a predefined, randomly generated key., this has the 
effect of breaking up any regujar patterns in the. input 
stream, yielding a/oughiy uniform djstribution of 0's and . 
1's. When encpdecjjn the PanaMark^ these .correspond 
to white and black bits, respectively.. In, this. way Jthe 
"good" PanaMark 61 is created which has sufficient «dg- . 
es for the implicit clocking necessary to read the data 
bits. The randomization, step aisq gives the mark a non- .. 
descript. gray appearance that users, may find, less dis-.. 

- trading than readily recognizable, patterns. . 
[0058] -During the decoding process, t the output bit- 
stream js, de-randomized by XOR'ing it with.exactly the . 
same. key.. .Hence* it is vital that.the encoding : and de- 
coding procedures use the. same key. It ,i$ possible, in 
rare cases, thaithe randomized.bitstream wjll be degen- 
erate such thai' it .contains ^jarge' white or black regions', 
as illustratqc! by the "bad' Pan^ark 60 of FIG. 7A. .To 
avoid.thjs scenario,)^ can t option- 
ally choose frprn among ,a small assortment. of possible 

.. keys. The selection is giiidedby the.quality i .(i.e. , ,the ran- 
domness) of the output bitstream. Even though the de- 
coding procedure. cannot know a pripri which key the 
encoding.side has used, it is a simple. matterto try each 
key and confirm, the, results using the output from the 
; ECC r pfOcess, Le., the correct key w'll beJbe one that 
produces the .fewest .number jOf errors from the ECC. 
process; - . ; , . 

[0059] FIGS. 8A and 8B are flowcharts illustrating the 
reading procedures. for the barcode symbologies pre- 
sented herein. The method described by FIG. 8A is di- 
rected to the embodiment of the PanaMark in which no 
border or anchor, bits^are present, while the method de- 
scribed by FIG. 8B is directed to the embodiment of the 
PanaMark which includes anchor bits, but no. border. 
Printing is handled by^a.complementary set of.routines. 
that are more straightforward as no image processing 
is involved, and are.fuJIy described in the '280 ; Applica- 
tion. : ''. 
[0060] In FIG. 8A, the first step 70 locates the Pan- 
aMark, which has no border or anchor bits, on the print- 
ed page. Under most circumstances, it is assumed that 
the region around the mark is relatively noise-free; ordi- 
narily it is positioned, in an unused corner of the page 
(as,. for example, in FIG, 1 A). ; As more particularly dis- 
cussed with respect to FIGS. 9 A and 9B, the locate step 
70 operates by- first calculating vertical and horizontal 
histograms of the number of black pixels in the scanned 
page. Because the scanned PanaMark is randomized 
and therefore has an approximately equal distribution of 
black and white pixels, it will have a greater number of 
black pixels than is normally present within the region 
of the document in which it is positioned. Each histo- 
gram, is sequentially examined, row by row for the ver- 
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tical histogram and column by coiumri for the horizontal 
histogram, until a first predetermined level is reached. 
Next, the process "backs up sightly" until a second low- 
er predetermined level is reached! This is selected as 
the first boundary, then, moving forward from where the s 
first predetermined level was first found, the histogram 
is examined to find the 1 point where it drops belowthe 
first predetermined level, the process'riext 'moves for- 
ward slightly" until the second predetermined level is 
reached, similar to the "backing up" process described 10 
above. Thifpoirit is selected as the second boundary of 
the mark/ By perforrr.ing' this process on both the hori- 
zontal and vertical histograms, a box bounding the Pan- 
aMark is selected. The "backing up slightly" and "moving 
forward slightly"* steps ensure that the corners are not * 'is 
cut off, especially in the case of a' skewed PanaMark. 
[0061] " In step 71 of FIG. 8A,'the skew of the Pan- ' 
aMark is estimated using the "Hough Trahsiorm" proce- 1 ' - 
dure. This procedure is described in greater detail with' 
respect to FiGS. 11-13. In this procedure, horizontal 20 
edges are first found for each row '61 logical 1 data pixels' * 
By using the Hough Transform, a * representation for a 
line which is in parallel wfth the horizontal edges is found ] 
(which, if skewed, 'will not be purely horizontal), ahdthe 1 ' f 
angle between this line' and a purely horizontal line is '-25 
calculated, giving the skew aricjle. * - • 
[0062] At step 72, the skew angle is evaluated. If the x 
. skew angle $ is' below a certain predetermined threshold 
a, no deskewing is required, arid the cropping step '75* 
is performed next. If the skew angle <t> is greater than the ' 30 
predetermined angle a, then the trigonometric deskew ; 
step 74 is performed next. As is more particularly 'de- *'- 
scribed with respect to FIG*. 1 7, once the skew angle is 
known; it is possible to calculate the position' of the 
deskewed image mathematically 'based upon the 35 
skewedimage. ,..1 >. 

[0063] Once the deskewing, if any, is completed, at 
step 75 a tight bounding box is" cropped around the Pan- 
aMark, which' is now oriented with clean horizontal and 
vertical edges. The cropping step is performed in a man- *o 
ner much like the locate step' 70, 1 but* with tighter toler- ; 
ances. Finally, at step 76, the bits are read out, yielding ^ ! 
an N x M bit array (20 x 20 bit in the preferred embodi 1 
ment). The read process 76 is more particularly de- - 
scribed in the '280 Application and with respect to FIG. -*5 
18. • ■ • '> 

[0064] In FIG. 8B, the first step 70 locates the Pan- 
aMark, which has anchor bits but no border, bn the print- 
ed page, in the same manner as is generally described ' ' 
above and more particularly described with respect* to so 
FIGS. 9A and 9B. As previously mentioned with respect 
to FIG. 8 A, it is assumed that the region around the mark 
is relatively noise-free. . - *:.»••. 

[0065] In step 71, the skew of the PanaMarK is esti- 
mated. Two alternate methods for estimating the- skew ss 
are described herein. The first method is a : "template 
matching" procedure which is only effective for small an- 
gles of skew (it is not used in FIG. 8A because it requires 



that the PanaMark to be decoded have anchor bits in 
each corner), which is more particularly described with 
respect to FiGS. 10A and 10B. Generally, this process 
operates by aligning two templates closely to two adja- ■ 
cent corners of the PanaMark, for example, the left top 
corner and the left bottom corner. Each template forms 
a right angle and preferably has- an inner row of black 
bits and two outer rows of white bits. Each template is 
moved bit by bit towards the respective corner, until the 
closest possible match is found, aligning the inside row 
of black bits with the respective comer. Although the 
PanaMark may be skewed, each template remains 
aligned vertically and horizontally as it is matched with 
the respective corner. Once the closest match' is found 
for each corner, if the PanaMark is skewed, the black 
corner bits of each template will not be aligned vertically. 
The angle between the vertical (or horizontal, il the top 
left and top right comers were selected instead) line ex- 
tending from the corner bit of the first template;- and the 
line extending from the corner bit of the first template to 
the corner bit of the second template represents the 
skew angle, and can be ; calculated by simple trigono- 
metric methods' 

[0066] In the second method, the "Hough Transform" 
is used to calculate the skew angle. This method, which 1 " 
is generally described above with respect to FIG. 8 A and 
more particularly described with respect to FIGS. 11-13" 
is effective for a larger range of skew angles (and does' 
not require that the PanaMark have anchor bits). The" 
choice of template matching procedure is a tradeoff be- ; 
tween the amount of skew to be corrected and the>* 
processinglime allotted, because the Hough Transform"* 
procedure requires more processing time. If skew an-" 
gles greater than' can be detected using the template 
'matching procedure are expected, the Hough Trans- 1 
form method should be used. If processing time is at a 
premium, and smaller skew angles are expected, the 
template matching procedure should be used. In step 
72,' the skew angie is evaluated. If the skew angle 0 is 
below a certain predetermined threshold a, no deskew- 
ing is required, and the cropping step 75 is performed 
next. If the skew angle $ is greater than the predeter- 
mined angle p, then the trigonometric deskew step 74 
is performed next, as is generally described with respect 
to FIG. 8A : and more particularly described with respect 
to FIG . 1 7. If the skew angle <j> is between predetermined 
angles a and (3, then the simple deskew step 73 is per- 
formed next. The simple deskew method uses a shear 
rotation technique. In this process, one corner bit, for 
example, the upper left corner bit is assumed to be cor- 
rectly positioned. Based upon the skew angle, each row 
is sequentially examined to determine the number of 
pixels to be shifted, right to left for example: Each col- 
umn is likewise sequentially examined, to determine the 
number of pixels to be shifted, top to bottom for example. 
Next, the pixels are shifted as designated in each row, 
the finally the pixels are shifted as designated in each , 
column. - • : 
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[0067] As described with respect to FIG. 8A, once the 
deskewing, if any, is completed, a tight bounding box is 
cropped around the PanaMark at step 75 and the bits 
are r;ead out at step 76 : yielding an N x M bit array (20 
x 20 bit in the preferred embodiment). 
[0068] It is possible, at various steps in the. process, 
to check assertions as to whether a valid PanaMark has 
been detected. For example, the PanaMark is expected 
to fall within a certain size. range, and to obey specific 
properties with respect to its texture .(roughly 50/50 
black and white). In FIGS. 8A and 8B, error checking is , 
shown with respect to the locate step 70, the crop step 
75 and the read step 76. In later stages, after the read : 
step 76, the ECC can also be used to confirm validity. If 
at any point an expected characteristic is not ; found, 
there are two options: (1 ) back up and try a different ap- 
proach, or (2) abort further processing., Note that it is 
critical for the default steps to be very fast. Only when , 
special processing is required should more time be . 
granted to read the PanaMark. This is why, in FIG. 8B 
there are two alternate methods presented for perform- ( 
ing the deskew step, the trigonometric deskew step 74 
is used only when necessary, and normally the simple 
deskew step 73 is used, which operates more quickly 
than the more complicated trigonometric deskew step 
74. However, as shown in FIG. 8 A, for the PanaMark • 
without anchor bits, only the more complex procedures 
for estimating skew angle can be used. 
[0069] FIG. 9 illustrates the two steps, that are re-- 
quired to carry out the locate . step 70. The goal , of the , 
locate step 70 is to find a bounding boxfor the PanaMark 
that is assumed to be somewhere in the input image, for 
example, PanaMark 10 shown in FIG.1A in the. lower, 
right corner of document 1 1 . Normally, the PanaMark is 
the only object present at that particular position on the, 
page, other than an optional text label. In the event the 
PanaMark is embedded within the content of the image, 
or other large components are present in the input im- 
age, more sophisticated procedures based upon mor-.. 
phological operations and statistical analysis must, be 
used. Such cases are likely to be rare in. practice, how- 
ever During this process, it must.be anticipated that 
there will be a small amount of "speckle" noise, and that 
the entire image may be skewed. - 
[0070] The locate process illustrated in FIG. 9 is 
based on y-axis and x-axis projections. First, with refer- 
ence to FIG. 9A, the vertical extent (i.e., the top and bot- 
tom boundaries) of the PanaMark 90 is determined. To 
that end, the number of *black pixels in each row of the 
input image on which the PanaMark 90 is printed is 
summed, creating a y-axis projection 104. Processing 
proceeds from top to bottom on the y-axis projection 
104. In the embodiment shown, the label is below the 
PanaMark, although as one skilled in the art can under- 
stand, if present, it can be located on any one of the four 
sides of the PanaMark 90. A minium threshold 100 is 
set so that small amounts of spurious noise are ignored. 
When a row is found that meets the minimum threshold 



100, shown by.point 91, the threshold is reduced slightly t 
and the top boundary 1 02 is found by moving backwards 
in the histogram 104 until the reduced threshold is met 
(point 92). This "reducing 0 of the threshold 100 prevents 
5 the chopping off of the corners of the PanaMark, in the 
event that the mark were skewed and the threshold set 
too tight. Next, . processing continues downward, as 
shown in point 93, until the histogram- ,104 falls below 
■ the minimum threshold ,100. At this point, the.threshold 
10 is reduced by the same amount as before to establish 
the bottom boundary 103 at point -94, although here the 
processing^ continues downward, jqstead of moving 
backwards. . , - , r . .> . _ . 

[0071] Once the correct vertical extent has ; been de- 
is termined (effectively eliminating, the label), a similar.opr 
, , eration is. performed to determine tr^e, horizontal extent 
V ; (jje.,.the right ? and left boundaries) of PanaMark. 90. 
Black. pixels are summed for each column, across the 
currently- active rows to create x-axis. projection histOT 
20 gram ,101.. Rrpcessing f proceeds from .left, to right. At 
point. 95, .the processing stops. when the. minimum 
threshold level 100, is met The threshold is then reduced 
slightly, and the. left boundary 105 is selected by moving 
backwards along .histogram ..101 * until the,, reduced 
25 threshpld ; is metjat point 96. Then, processing continues 
from-point 95, where, the minimum threshold Jeyel 100 
was found, now looking for a level less than the thresh- 
old 100. Once this js found at point 97, the threshold Js 
again reduced by a set amount, and the right boundary 
30 106 is selected at point 98 by continuing to move left to 
right untiUhe reduced threshold is ; fouod. Of course, the , 
order of finding the.vertical and horizontal boundaries of . 
the PanaMark can be reversed. 
[0072] Afterthelocatingprocessiscompleted, itisasr 
35 sumed that an accurate bounding box for the PanaMark 
has been determined,, which , can be confirmed in a 
; number of ways,, e.g. ,, the ECC process. ; . 
-.v [0073] Once the PanaMark is located within the page, 
it is necessary to estimate its skew angle, if any. This is. 
40 required because later processing steps (Le., reading) 
depend, on the. mark being nearly , perfectly straight. 
Even though the user.may.be careful when placing the 
page on .the scanning device, small amounts of skew 
can still arise. Qnce.the amount of skew is known, a 
45 deskewing step can be applied to properly align the Pan- 
• ; aMark. Two alternate methods are presented herein for 
estimating the skew angle, as described with respect to 
block 71 of FIG. 8. The simpler method, which requires 
that the PanaMark include anchor bits, is the template. 
50 matching skew estimation .procedure. The template 
matching skew estimation procedure requires much 
less processing time than the more complex Hough 
Transform procedure, but only the Hough Transform : 
procedure can be used to calculate the skew angle of 
55 PanaMarks without anchor bits. 

[0074] FIG. 10 illustrates the template matching pro- 
cedure for skew estimation (step 71 of FIG. 8B), which 
works reliably for small skew angles, such as are en- 
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countered when the user exercises reasonable care in 
placing the page containing the PanaMark i 10 (with four 
anchor bits) on the glass of the scanning device, or when 
a scanning device incorporating an automatic sheet 
feeder is used 'to scan the page containing : the Pan- s 
aMark 1 1 0. ! As previously mentibned in conjunction with 
step 71 of FIG/ 8B,' sWw angles too large to be deter- 
mined by the template' matching procedure can be cal- 
culated usinlg the HbugK'Trahsform technique td be de- 
scribed in conjunction with' FrG'S: l l^r 10 
[0075] Referring! s t6 fr'GS. '10A' arid 10B,' templates ' 
111; 1+2 are 1 defined for the'nbrthwest (NW)- and south- 
west (SW) corners of the PanaMark 110, respectively. 
Each template 1 1-1 T, 1*12 preferably includes one row of 
black pixels' 117 arid two rows of white pixels 1 113. As : *5 
shown irVthe'lelt portion 1 of FIG. TO! each ternpia'te 111 , - 
112 is moved 'across the ajsprb^ !' 
aMark 11 0 in a'bttwise rrfannet to' locate the coiners. c ' 
Th is p rocess ' is -'indicate'd : by a f rows i i 8 sficrwin g [ the' 
movement of template "1 1 t toward the hortnwesi cornier, * J 
and by arrows 119 showing Ihe movement 'of template 1 
1 1 2 toward the southwest corner At eactVpdihtot move''-' ' ' 
ment, a 2-D Hamming distance" is cdnhputed' as id- well 
known in the art (i.e., the : number of 'pixel : lc«atlonfS'iri ''' 
which the template and the ' Imag'e do hot mat<jh)Vand v ' 
each template til, 11 2 is 1 adjusted back J and fbhrvuritil 
this distance is' mintmized for each corner. 
. [0076] Once the templates 111, 112 aire in their best- 
fit positions, the ahgfe 1 1 ^ 'between the comer pixels * ; 
115, 116 in the templates 111, 112 is determined as so 
shown in FIG. 10B in a manner well-known in the 'ahV • ; 1 
thereby yielding an est i mate r tof the skew angle d of the 
PanaMark 110. ■ - f- ; i.: 

[0077] Note that while the anchor bits in the PariaMark : 
110 may be skewed, the templates 111,112 are' not (i. 35 
e., the templates are pre'-defined, and'the skewangle is 
not a priori known).^Forthis reasorr the procedure is re- - 
liable only for smallske Wangles. The skew over a single • - 
bit is relatively minor" com (Dared to the ske w over the en- 
tire dimension of the : PanaMark 110: : Hence I 'the s tern- 40 
plates 111,112 will match well in their intended positions 
so long as the anchor bits in the image are not-sign'ifi- 
cantly distorted (i.e., the skew angle 0 is not todfarge).* 
[0078] The Hough Transformation can also b& used - 
to determine the skew angle for levels of- skew angle 
which the template matching procedure is 1 incapable of- 
handlingand for PanaMarks without anchor- bi'ts. : The / : 
Hough Transformation is one 61 the oldesPand most ro^ 
bust methods used in image- analysis and computer Vi- 
sion. It is a technique for detecting geometric features ! so 
such as straight lines, circles, etc 1 .,- in { a i digitaHmage. 
The Hough Transformation is a parametric transforma- 
tion which maps a set of points of a shape- into a small 
set of parameters characteristic of that shape. For 1 ex- 
ample, as shown in FIG. 11, points on a straight line 120 ss 
can be mapped to a pair of parameters (p, 0) by its par- 
ametric equation: r 



p = xcos(0) + ysin(G) 

where p, 121 , is the normal distance to the line from the 
origin, and 9, 122, is the angle of the normal of the line 
with the x axis. 

[0079] ' The above correspondence between all points 
on a straight line' and a parameter pair (p, 6) assumes 
that (p'; 0) of the straight line is known. As shown in FIG. 
12A, an arbitrary point (x, y) 127 in a plane, however, 
can be on ah' infinite number of straight lines, as depict- 
ed by lines 1 23a-T23d. In other words, a point (x, y) 127 
can be mapped to an infinite number of parameters (p, 
0). The set F xy = {(p; 0) I 0 < 0 < 180°, p= x cos(0) + y 
s/n(9)} 'represents all straight lines passing through (x, 
y) 127/ 

[0080] On the other hand, as shown in FIG. 12B, for 
any two points (Xq, y 0 ) 128 and (x,, y 7 ) 129 there is a 
straight line "i24 passing through both. Accordingly, 
there exists one pair of parameters (p*, 9*) common to' 
both set F^'^and set F x1 yV such that: ' 

[0081] Subsequently, if many points (x v y 7 ) in a plane 4 * 
form a straight line of parameters (p*, 0*), the parame-** 
ters shall be a member of all corresponding sets F xlyr " 
In other words; given a set of points in a plane, the ' 
Hough Transformation can be 1 used to detect the exist- 
ence cf a straight line of given parameters by calculating 
the frequency that the parameters appear in set F xjy f~ 
over all points. 

[0082] In practice, the line detection algorithm based 
upon the Hough Transform requires a 2-D accumulator- 
array representing the quantized parameter space (p;" 
0). To 'detect the lines, the procedure examines every 
foreground* pixel (x, y) in the image, and calculates all 
quantized values (p, 0) in the set F xy The resulting (p, 
0) entries in the accumulator array are incremented. 
[0083] After all pixels have been processed, the ac- 
cumulator array is; searched for high value peaks. These 
peaks indicate the parameters of the most likely straight 
lines in the image. The highest peak represents the pa- * 
rameters of the' "dominant" straight line in the image. 
[0084] As shown in FIG. 13A, when scanned perfect- 
ly, the edge pixels of a PanaMark 1 30 corresponding to 
the logical bits in a same row in a PanaMark 1 30 reside' 
on the same scanline 132. As shown in FIG. 13B, when 
the PanaMark 131 'is skewed, these edge pixels align- 
roughly in the direction of skew, and thus the edge pixels : 
form straight lines, such as parallel scanline 133 and 
perpendicular scanline 1 34 with respect to the orienta- 
tion of PanaMark 1 31 . Thus, the Hough Transform pro- 
cedure can be used to identify these straight lines, such 
as scan lines 133 and -134, and subsequently, deduce 
the skew angle of PanaMark 1 31 - r 
[0085] The first step of the skew detection procedure 
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based on the Hough Transform is to identify the edge 
pixels of within a PanaMark. The, black and white pat- 
terns within a PanaMark have four sides (called ."edges" 
herein). Figure 14A shows the four edges for a . black 
pattern 140, which are aligned in two orthogonal orien r s 
tations, i.e., horizontal edges 141 and vertical, edges . 

142. For the purpose of skew estimation, it is sufficient 
to detect edge pixels in only one orientation, for exam- 
pie, only horizontal edges 141. 

[0086] As shown in FIG. 14B, two edge, masks. 143, io 
146 are applied to a PanaMark pattern to identify the 
horizontal edge pixels. Although one edge mask would 
be adequate to perform skew estimation,, in the pre- 
ferred embodiment disclosed herein two edge masks- 

143, 146 are used to provide robustness to the identifi- , is 
cation procedure. Any black pixel locations whose black, ;V . 
and white pattern in vertical position matches, either of ., : 
the masks are considered edge pixels. The width of the . 
mask IV 145, 146 is preferably set to be one-half of the 
number of -pixels of a. PanaMark logical, bit in { the #o 
scanned image. , 
[0087] . FIG. 15A shows a scanned PanaMark 150 with... , 

a skew angle of 10° and FIG. 15B shows the horizontal 
edge pixels 151 detected in PanaMark 150. Once the 
edge pixels 1 51 are identified, the Hough Transform pro- ; 2S 
cedure is applied to detect the "dominant" straight line 
in edge image 1 52. The skew of the PanaMark 1 50 is 
then calculated as the skew of the dominant line in edge 
image 152. 

[0088] After the skew angle has been estimated, us- 30 
ing either of the procedures just described, it may nec- 
essary to apply a corrective deskewing.step. As with the 
other steps involved in reading a PanaMark, there are 
a number of different ways to accomplish this. 
[0089] As shown in FIG. 16, a "shear rotation" proce- 35 
dure can be used to deskew the PanaMark.for relatively 
small levels of skew. This is a one-dimensional approx- 
imation to a true, two-dimensional trigonometric rotation 
which would be more costly in terms of computing time 
to compute. Shear rotation works well for the kinds of 40 
small skew angles usually expected. If larger values are 
detected, the more complex and hence more time con- 
suming process described with respect to FIG. 17 must . 
be used. Two separate, one-dimensional steps are per- 
formed which amount to sliding, the appropriate rows is 
and columns of the PanaMark: a horizontal shear fol- 
lowed by a vertical shear. • ; 
[0090] The shear rotation process begins by assum- 
ing that the upper-left-most, pixel 1 63 of the PanaMark 
is correctly placed. Based on the skew estimate deter- so 
mined earlier, a compensating number of pixels, to shift 
each successive row is computed, working down from 
the top. Likewise, a compensating number of pixels to 
shift each successive column is computed, again, as- 
suming the upper-left-most pixel is correctly placed. The ss 
rotation is then performed by shifting first the rows, then 
the columns, as depicted in FIGS. 16A and 16B. Note 
that FIGS. 16A and 16B combine the separate steps of 



calculating the shift with the actual rotation. The first 
step, as shown in the horizontal shear step, is to calcu- 
late working from the top the number. of compensating 
pixelsto be shifted for each row.of image 160. As indi- 
cated to the left of image ,160 in FIG. 16A, the top four 
rows require no, shift, the next two rows, require a one 
pixel shift, and the two bottom-most rows require a two 
pixel shift. The next step, as shown in the vertical shear 
step, is to calculate working. from the left the number. of 
compensating pixels to be shifted for each column of the 
original image 160., As indicated below image 161 of 
FIG. 1 6B, the first two, columns, require no sJiift, the next 
two columns require a one pixel, shifty and. the final two 
columns require, a twq pixel shift. .Next, the horizontal 
shear process is performed on image 160 by shifting.the 
lower four rows of pixels. Finally, the vertical shear proc- 
ess is performed, by shifting the right fou.r cplumns of . 
pixels by- the., calculated amounts^ giying.the final result 
shown in RIG. l6C...Jhe act uaL PanaMark js, only a ..6 x 
. 6 bit array^ ancUhe outer white regions : shpwn in Pan- 
aMark 162. will be eliminated during Jhe cropping step 
75 depicted in jFIG ; 8 and f discussed below. , 
[0091 J. , The example used in. FIG. 16 was chosen to 
illustrate,. the two shearing operations, but the ^coarse 
resolution of the bitmap and the large degree of skew 
result in a misleading amount ol, distortion in the output 
image, which does, not occur. in practice. .. . <; . 
[0092] . . As shown in FI G. 17, to implement the trigono- 
metric deskewing step 74 described in FIG. 8, the rela r 
tionship between a two-dimensional image .1,71 and its 
skewed (or rotated), counterpart 170 can be described 
as an affined transformation of coordinates by the well-, 
known equatiqns: t ■ : , 

* x 1 = >rcos8 - ysine ' (1) 

' " y = xsinG + ycose * (2) 

where (x, y) arid (x',y) are\he pixel coordinates in the 
originai image 171 and the skewed image 170, respecr 
tively, and ; 9 is, the skew angle. .*■.-- 
[0093] Consequently, once B is known, the original im- 
age ,1/1 ,can be recovered from its skewed version 170 
by the, inverse transformation: ■ 

yx~ ycos'e + y*sin6 (3) 

■ >r f/ y± - /sine + /cosQ ' (4) 

[0094] A straightforward way of implementing skew 
correction is to compute the deskewed image 171 coor- 
dinate (x, y) for each black pixel (*',/) in the input image 
170 using the inverse transformation (equations (3) and 
(4)), as shown by the dotted line 173 in FIG. 17. Each 
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pixel (x, y) is then set to the be the color of pixel (x\ y). 
[0095] Alternatively, the operation can be done in the 
opposite way: for every pixel (x, » in the goal image, its 
position in the input irriage (x'y) is computed via ecjua- ' 
tions (1 ) and (2), as shown by the dotted line 172 of FIG.' s 
17. Tne pixel color is then copied from the computed 
location/ to {x, y). * 

[0096] : * Frdrri the';mathemdtical point of view, the two 
implementations' are equivalent; For digital images; 
however, they'are not identical. Both methods' have their to 
shortcomings*. Due 7 tonumerical round-ups, the first im- 
plementation may generate smaH'holes in the deskewed 
image, while the 'secondMmplementktion may result' in 
misplaced edges. Although 'each melhbd may provide 
satisfactory' results; the second 'method is the preferred • is 
method: \ ' : * 'y 7 ' " : i ^ v \ 1 ' " 

[0097] ' After the PahaMaVk has been deskewed, the 0 
bou nding box ca Ic u lated' with' respect 1 to 1 F I Gl 9 su r- " 
rounding the 'PanaMark may be too' "loose "/i.e. 1 , there r ' 
are more pixels within the bounding box than is needed 20 
to extract the data bits. For example; as shown in FIGS:" 1 
16A-16C, if the original bitmap measures 6 x 7 pixels 
and the final deskewed oitrnap measures '6' x* 6 pixels, > ' 
two columns and one row of pixels -must tfe discarded! "*•""' 
Hence, another set of operations much like the original - ^^5 
locate step described with respect to EIGT 9 is -per- : 
formed, with slightly tighter thresholds since L it isno'loncf ,:V 
er necessary to worry about chopping off thecdrhers : of • 
the PanaMark: < * . <••" : ^ , 

[0098] At this point] the scanned PahaMark 'hWs been 30 
located, deskewed, and tightly cropped. The- next step ••- 
is to read out the data bits, which transforms' the Pan- 
aMark from the image domain, where each bit is repre- 
sented as a collection of black or white pixels, to a 20 x 
20 bit array, in the preferred embodiment, of logical val- 35 
ues. Note that since the PanaMark symbology is clock- 
free, there are no predetermined reference patterns to 
help orient the reading process., However, the logical 
size of the PanaMark is known in advance, for example, 
a square measuring 20 bits on each side in the preferred 40 
embodiment. Moreover, because the bits in the mark are ' * 
pseudo-randomized during the encoding process; any 
particular row or column of pixels will show a higher dis- - * 
tribution of black-white and white^black transitions near 
the edges in the logical rows and columns, and a lower ^5 
distribution near the centers. This process is fully de- 
scribed in the '280 Application. Once the horizontal and 
vertical center lines have been established by the proc- 
ess described in the '280 Application, the bits are read 
out of the PanaMark by recording the pixel value lying so 
at the intersection of each horizontal and vertical center 
line (for example, setting each "white" pixel value = "0" 
and each "black" pixel value = "1 "). 
[0099] As shown in Figure 18 A, the basic clocking 
process corresponds to starting the NW (upper-left) cor- ss 
ner of PanaMark 184 and reading out the data in a left- 
to-right, top-to-bottom fashion. As shown in RGSr* 18C- 
18D, the same paradigm can be applied, though; begin- 



ning at any of the other three corners. (SW, NE, SE) as 
well. In theory, the output from these four alternate 
clockings should be the same. However, because of 
scanning variations and round-off errors, the results can 
in fact be different. However, based on feedback from 
the ECC step, there is a choice of using the best of the 
four different readings, made in the four alternate direc- 
tions described in FIGS, 18A-18D. In other words, if the 
ECC reports two errors detected during the left-to-right, 
top-to-bottdm clocking, while the other clocking all re- 
turn 1 5 errors, then the first "set of results should be used 
and the remainder discarded: This has the effect of re- 
ducing the overall error rate with high probability 
[0100] While the present invention has been particu- 

' larly' shown and described with reference to the pre- 
ferred embodiments and various aspects thereof, it will 
be appreciated by those of ordinary skill in the art that 
various changes and modifications may be made with- 
out departing from the spirit and scope of the invention. 

1 It is intended that the appended claims be interpreted 
as including the embodiments described herein, the al- 
ternatives mentioned above, and all equivalents thereto. 

'Claims 

1. A method of encoding information for printing on a ^ 
printed medium, comprising the steps of: 

' i converting said information into a data stream 

- ' having a sequence of ? 0 Bi and "1" characters* 

representing data bits; ! , ; • ■ . ■. ~> 

^randomizing said data stream to produce a ran-* 
domized data stream having a sequence of M O n * 
' and "T" characters; 

'* formatting the randomized data stream into a 
-■' two-dimensional bitmap of rows and columns 

' ' including only the randomized data stream bits; 

J ' and 

printing the formatted bitmap onto said printed 

mediumV ' • T 

- - . j ' 

2. The method as 'in Claim 1 , wherein said formatting 
step includes the addition of an anchor bit into at 
least one corner of said two-dimensional bitmap. 

3. The method as in Claim 1 or 2, wherein said format- ■ 
ting step includes the addition of an anchor bit into" 
each corner of said two-dimensional bitmap. 

4. The method as in any one of Claims 1 -3, wherein 
said formatting step includes the step of mapping 
the randomized data stream such that each data bit 
within each block in said data stream is separated 
from every other data-bit within the same block by 
at least one row and one column in said two-dimen- 
sional bitmap. ' * * 



11 



BNSDOCID: <EP 0962883A2_I_> 



21 



EP0 962 883 A2 



22 



5. The method as in any one of Claims 1-4, wherein 
said randomizing step comprises the step of repeat- 
edly randomizing said data stream by alternate des- 
ignated keys until the randomized data stream con- 
tains a substantially uniform distribution of "O" and 
°1 ■ characters. 

6. The method as in any one of Claims 1-5, further 
comprising the step of compressing said informa- 
tion. 

7. The method as in any one of Claims 1-6, further 
comprising the step of adding error correction code 
data bits to said data stream. 

8. The method as in any one of Claims -1-7, further 
comprising the step of duplicating said data stream 
to produce redundant data for improving reliability 
upon encoding] 

9. An information storage medium comprising a two- 
dimensional data barcode printed on a printed me- 
dium comprising a bitmap formatted representation 
of randomized encoded information having a prese- 
lected bit row and column length. 

10. A method of decoding randomized information 
printed on a printed medium in the form of a bitmap 
of rows and columns of data pixels representing en- 
coded data bits, each of said data pixels being ei- 
ther a first or second color, comprising the steps of: 

scanning said printed medium to digitize said 
bitmap; 

formatting said bitmap to a pixel based gray- 
scale representation; 

setting the threshold intensity level based on 
said grayscale representation such that pixels 
above said threshold correspond to said first 
color and pixels below said threshold corre- 
spond to said second color; • 
determining the skew angle of said digitized bit- 
map; 

deskewing said digitized bitmap so that said 
skew angle is reduced to substantially zero; 
cropping said digitized bitmap; 
reading out binary data from said digitized bit- 
map in a predetermined pattern to produce a 
one-dimensional array of digital data; 
derandomizing said one-dimensional array, of 
digital data; and 

error-correcting the derandomized one -dimen- 
sional array of digital data to produce a sub- 
stantially error-free digital representation of the 
encoded information. 



in addition to said bitmap and said method includes 
the steps of formatting said bitmap and said other 
information to a pixel based grayscale representa- 
tion and determining the row and column bounda- 
s ries by sequentially comparing the number of first 
pixels in each row and column respectively to a pre- 
determined threshold. 

12. The method according to Claim 10 or 11, wherein 
10 said skew angle determining step.uses either a tem- 
plate matching procedure or a Hough-transform 
procedure. J' . * V.. V..- V 

1 3. The method according to any one of Claims 1 0-1 2, 
is wherein said deskewing step uses* either a shear 

rotation procedure or a^trigpnometnG-proeedure^ 

14. The method According to/any one of Claims 10-13, 
wherein said reading step reads said data in 

20 each of four alternate directions creating four alter- 
nate one-dimensional arrays, of digital data, and 
said error correcting ;st§pjseiects the one-dimen- 
sional arrays of digifai data with the least number of 
errors for output^ 1 ■ 
25 ' • ----- - - • - - >l 

15. The method according to any one of Claims 10-14, 
wherein said derandomizing step repeatedly de ran- 
domizes the or each said one-dimensional array of 
digital data by alternate designated keys until the 

30 errors detected by said error correcting step fall be- 
low a preset threshold. 

16. The method according to any one of Claims 10-15, 
' wherein said reading step comprises the steps of: 

35 ' ' 

determining the number of horizontal and ver- 
tical edges in each respective row and column 
of said bitmap; 

determining the row center lines from the local 
40 minima in the horizontal edge counts; 

. determining the column center lines form the lo- 
. , : -k :*ca\ minima in the vertical edge counts; 

, V and 'reading out binary data located at the in- 
l ' \ ( s v r tersection of each row center line and column 
^ " * :S : -center, line in a predetermined pattern to pro- 
.. * .\ duce a one-dimensional digital representation 
' V , r; of said digitized bitmap. 




15 



20 



25 



11. A method according to Claim 1 0 wherein said print- 
ed medium has other information recorded thereon 
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FIG. 8A 
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FIG. 8B 
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FIG. 13A FIG. 13B 
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FIG. 15A FIG. 15B 
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